home *** CD-ROM | disk | FTP | other *** search
Text File | 1988-05-05 | 76.5 KB | 2,351 lines |
- General Introduction
-
- AnyWord, formerly called "Total Recall", is a
- general text indexing and retrieval program. It
- consists of two parts: Indexer and Finder. When
- Indexer is run against a set of text files, it
- produces an index much like the index of a book.
- These index files can then be read by Finder to
- quickly locate any word or combination of words in
- the text.
-
- Again, like a book index, your text is untouched.
- All of the indexing information is carried
- separately and your text files are not modified or
- re-arranged in any way. This means that you can
- "play around" with AnyWord as much as you want
- to without the worry of making an inadvertent change
- to your text files.
-
- AnyWord assumes nothing about the placement
- of your text files or the organization, if any, of
- your text. Your text data can be in any format or
- no format at all; your files can be any size, on any
- disk, with any names. Of course, there are
- practical limits. This version of AnyWord will
- probably max out at somewhat over 1 MB (1 million
- bytes) of text spread across 254 files.
-
- You supply information about your files through
- the profile file, used by both Indexer and Finder.
-
- Some AnyWord features:
- - searching by any word or combination of words
- - display of selected text
- - extraction of selected text into new files
- - searching with simple Boolean logic,
- i.e., AND - OR - NOT
- - searching with "wild cards", e.g., comput? for
- references to computers,computing, etc.
- - exit to DOS and return without disturbing selected
- text or search results
- - review of previous search results within a session
- - view of all of the words that have been indexed
- - automatic file location via the DOS PATH operation
-
- What AnyWord cannot do:
- - make sense out of special formatting codes within
- word processing files. AnyWord works best
- with ordinary ASCII files. It will also work well
- with Wordstar files in document mode. It will
- work less well, though it will still operate, with
- files that contain binary data, such as Microsoft
- Word files. Most word processing programs have an
- option to save a document as an ordinary ASCII
- text file -- you'll probably want to take
- advantage of this.
- - any kind of numeric calculation or comparison. To
- AnyWord everything is text -- just text.
-
-
- AnyWord is useful for:
- - Locating simple information. If you had our Movie
- Database, for instance, you could instantly locate
- all of Debra Winger's movies or director John
- Carpenter's movies.
- - Locating combinations of text. Using the Movie
- Database as an example again, you could look for
- all of the movies Clint Eastwood made with
- Sondra Locke during the 1970s.
- - Extracting chunks of text into new files. Have
- a lot of research notes that you want to
- organize? You can build new files out of segments
- of the old ones -- and without changing any
- existing text.
- - Rapidly browsing through a collection of files.
- From any point within any file, you can page
- backwards, forwards, across files, to the
- beginning, to the end, etc.
- - and more.
-
- AnyWord was formerly sold as a commercial
- product (for $49.50) under the name "Total Recall".
- You may still see occasional references to that
- name -- but AnyWord and the old Total Recall are
- the same. (We sold our trademark and now sell
- a different product called Total Recall.)
-
- Now read on to find out how to get started.
- How to Use This Manual
-
- If you're in a hurry to get started, read the
- Installation section and the two "Getting Started"
- sections. These should provide enough information
- for you to index some text files and do rapid
- lookups on the words in those files.
-
- However, if you believe that: "If you don't have
- time to do it right, when will you have time to do
- it over?", then skim through the manual completely.
- If you do this, you'll understand how the system
- works, what you can expect from it, and what it
- expects from you. Then, try the system out, using
- the manual for reference.
-
- AnyWord is a fairly complex, powerful
- system. It probably has far more features than
- you'll ever use, but if you acquaint yourself with
- what those features are, you'll be less likely to
- spend much time scratching your head trying to
- figure out how to get the system to do what you want
- it to do.
- Installation
-
- First, look on the program disk. If there is a
- file called README.1ST, then do so. (E.g., enter:
- type readme.1st)
-
- For Hard Disk Users:
- 1. Copy all of the files over to your hard disk.
- Recommended: create one subdirectory for
- everything on the AnyWord disk.
- For instance:
- C> mkdir \anyword
- insert the AnyWord disk into the A: drive
- C> copy a:*.* \anyword
-
- Alternatively, you can copy FINDER.EXE and
- INDEXER.EXE to a subdirectory in your DOS
- execution PATH.
-
- For floppy-disk users:
- 1. Make back-up copies of the distribution disks
- and store the originals in a safe place.
- 2. If you have a two-disk system, verify that you
- have an AUTOEXEC.BAT file on the diskette that
- you normally boot up with. Check that it
- contains:
- PATH=A:\;B:\
- (The backslashes are required for AnyWord.)
- AnyWord (optionally) uses the DOS PATH to
- find files, so specifying both A & B will allow
- you to put your files in either drive.
-
- For all users:
- 1. Verify that you have a CONFIG.SYS file in your
- root subdirectory or on the diskette that you
- boot up with. If you do not have a CONFIG.SYS
- file, then you must create one. The
- instructions to do so are in your DOS manual,
- but all that is actually required is that you
- create a file with this name and put in the
- ASCII text lines:
- files=20
- buffers=24
-
- 2. If you already have a CONFIG.SYS file, then
- verify that you have specified files=20 (or some
- higher number.) If not, then add it.
- Recommended for better performance of DOS in
- general:
- buffers=24 (or some other number between 20
- and 30)
- Getting Started with Finder
-
- If you have not already done so, copy the files
- on your distribution diskette(s) onto your hard disk
- or onto backup floppy disks.
-
- Finder requires a "profile" file for all file
- collections, indexed or not. This profile file
- contains information that Finder needs to run. For
- The Movie Database, the file is PROFILE.MOV.
- A sample profile, PROFILE.SMP is also included to
- help you create your own profiles. (The simplest
- way to proceed on this is to make changes
- to a copy of one of our profiles.) A profile
- file must be specified each time you run
- Finder, so the command from DOS to run Finder can
- be:
- FINDER PROFILE.MOV
- or, more generically:
- FINDER profilename.ext
-
- Before starting Finder, verify that the contents
- of F-FILES.MOV (or F-FILES.ext) specifies the
- correct drive and/or path name for each of your text
- files. THe F-FILES file is normally created by
- Indexer and no further changes are necessary.
- However, if you have moved your files since Indexer
- was run, then you may have to correct their
- directory or drive designation in F-FILES. (If your
- text files are in a subdirectory that has been
- specified with the DOS PATH command, then a drive or
- path name may not be required.)
-
- Once in Finder, you will find that many of the
- commands are self-explanatory. There is also a Help
- file that you can access. Keep in mind that you can
- try almost anything without worrying about making
- inadvertent changes to your text files, because
- Finder reads but does not write to your text files.
- The only exception to this try-anything rule is
- swapping diskettes in the midst of a Finder session.
- This is only permissible in certain circumstances.
- More on that in the "Files" section.
-
- The key to getting the most out of Finder is in
- controlling the Finder environment with either the
- profile or the [M]ode command. For more on that,
- see the section "Controlling the Environment".
-
- Just to get you started, try this:
- ([ENTER] means press the Enter/Return key)
- finder profile.mov [ENTER]
- s
- John Wayne [ENTER]
- d [ENTER]
- AnyWord will find all references to "John Wayne"
- Each time you press: d [ENTER] you'll
- see the next reference, until there are no more.
- Use the Home, End, PgUp and PgDn keys to browse
- around in the file. (The files you're looking at
- are LF, LN, and LZ.)
-
- There are options on the search and display
- commands, as well as other commands, but you'll
- probably be able to retrieve much of what you
- want by just knowing a few simple operations.
-
- Let's try another search, with more explanation.
-
- S
- - to begin another search. Finder will ask
- you what you want to search for.
-
- 1962[ENTER]
- - to search for all references to
- "1962" in the database.
- Finder will report 6 references.
-
- D
- - to display the results of your search. Finder
- will ask you which reference you want to look at.
-
- 2/1[ENTER]
- - to look at the first reference in your
- search, which was the second search you did.
- Finder will display the first movie it
- found.
-
- D
- - to do another display. Finder will ask you
- which one.
-
- 2/2[ENTER]
- - to look at the next movie.
-
- D
- - to do another display.
-
- [ENTER]
- - will automatically display the next
- reference, in this case, set 2, element 3.
-
- S
- - to do another search.
-
- Diane Keaton[ENTER]
- - to find Diane's movies.
-
- D
- - to look at the results.
-
- 3/1-3 ;l=a1[ENTER]
- - will show you the titles of Diane's
- movies. (What this command means is:
- display search set #3,elements 1 through
- 3, but just the first line of each.)
-
- If you're not used to this set/element concept,
- it may seem strange. But after you've played around
- with the commands for a while, it'll become
- familiar. Remember, you can't hurt anything by
- putting in the "wrong" command.
-
- If you have problems with a particular command,
- pressing the [Esc] key will generally bring you back
- to the command menu.
- Getting Started with Indexer
-
- Indexer is an independent program that analyzes a
- collection of text files. The location(s) of each
- valid word in each file are assembled into a set of
- index files, for later use by the Finder lookup
- program. (These files are described in detail in
- the Files Appendix.)
-
- Indexer is controlled by means of the "profile"
- file, samples of which are included on your
- distribution diskette, as PROFILE.MOV, PROFILE.HLP,
- and PROFILE.SMP. (Your profile file can have any name.)
-
- Indexer is called from the DOS command line as:
- INDEXER profilename. E.g. indexer profile.mov
-
- All of the options used by Indexer are
- provided in the profile file. These options are
- described in the Indexer Profile section and in
- the supplied profile themselves, as comments.
-
- You can run Indexer against your own text files
- with the sample profile, PROFILE.SMP. A better way
- to go, though, is to spend some time studying the
- options so that you can create a profile that will
- maximize the usefulness of AnyWord. Indexer
- can take all night to run (on a 1M byte database),
- so give it some thought before starting it up.
-
- While Indexer is running it will show you some
- statistics. These are not necessary, but are
- presented for those of us who like to see what is
- going on during long programs. Basically, Indexer
- will make 1-2 passes of each of your text files for
- each of the characters that you've specified in the
- profile. (That's why it can take all night.) In its
- basic form, the numbers on the left of the screen
- show you memory usage while the numbers on the right
- half show you how much disk space is being used for
- the index files. (The keyword file contains an
- entry for each unique word in your database; the
- location file contains an entry for each occurrence
- of each word.)
-
- Here are the steps you will have to take to start
- indexing files, once you've installed the software,
- modified CONFIG.SYS, if necessary, etc. This is the
- minimum necessary: (As you get more familiar with
- the system, you'll be making additional changes)
-
- 1 - Starting with a copy of PROFILE.SMP
- and using any text editor that can operate
- with ASCII files and no tab characters,
- change the FILES parameter in the profile
- to specify the files you want indexed. (It
- currently refers to some sample files
- included on the diskette. If you wanted to
- index all of the files in a particular
- directory, you would use:
- FILES=\dirname\*.*
- If you wanted to index files called "abc"
- and "xyz", then:
- FILES=abc
- FILES=xyz
- If you have a floppy-based system and wish
- to specify a drive designation, then:
- FILES=B:\abc.ext
- FILES=B:\*.cor
-
- If you wanted all of the files on a
- particular diskette, then:
- FILES=b:\*.*
- (IMPORTANT: The backslash "\" MUST be
- specified after a drive letter.)
- If you have Wordstar files in "document"
- mode, use:
- WSFILES=filename
-
- 2 - Recommended for floppy disk systems:
- Set your DOS path to A & B. That is,
- enter: path A:\;B:\
- (Again, don't leave out the backslash.)
-
- 3 - Run Indexer, specifying the sample Profile
- file: indexer profile.smp
-
- 4 - Go out for lunch. (The program will
- take about 40 minutes to index 100K, about
- 8 hours to index 1M.) You'll know the
- program is running because status
- information on the screen will continually
- be updated.
-
- 5 - When you get back from lunch, you will
- find a set of files in your current
- directory -- unless you assigned them
- elsewhere with the PLOC/PKEY/PFENCE/PDLM
- parameters. (See below.)
-
- These files will have, in size, about a 1:1
- correspondence with your original text
- files. If this takes up more space than
- you have available on a disk, then you will
- have to juggle files. For instance, you
- can remove the program disk after the
- program has been loaded or you can use a
- RAM (virtual) disk or ....
-
- The biggest file is the location file, the
- keyword file may be 1/3 of that, the fence
- file, if the fencing (Record Option) was
- selected, is 1/4 of the location file size.
-
- The following Profile parameters can be
- specified to assign the output files to
- appropriate drives or subdirectories.
- For instance:
- PLOC=C:\ prefix for location file
- PKEY=C:\ prefix for keyword file
- PFENCE=C:\ prefix for fence file
- PTEMP=D:\ prefix for temporary work files
- PDLM=C:\ prefix for delimiter file
- Don't forget the backslash.
- System Flow
-
- AnyWord can be used in either of two ways:
- in the normal text lookup mode or in a special
- browse-only mode. (You would use the browse-only
- mode when the text files have not been indexed.)
-
- Two examples are shown. Both use sample files
- supplied on the program disk.
-
- This first example shows typical processing for
- doing lookups. The second is a scenario for
- browse-only processing.
-
- Look-Up Processing
-
- Input to Indexer:
- 1. Text Files
- lf
- lz
- ln
- 2. Stop list of words not to index
- exclude.mov
- 3. Profile File
- profile.mov
- - containing:
- SUFFIX=.mov
- FILES=??.
- INDEX=YES
- ...
- _______________________________
-
- C> Indexer profile.mov
- _______________________________
-
- Output from Indexer, input to Finder:
- f-files.mov
- f-ik.mov
- f-ix.mov
- f-tab.mov
- f-fence1.mov (optional)
- f-dlmloc.mov (optional)
-
- Also output from Indexer but not used by Finder:
- f-stats.mov
- And, the Help file:
- f-help
- _______________________________
-
- C> finder profile.mov
- _______________________________
-
- Output from Finder:
- - Any extract files that you specifically create
-
- Browse-Only Processing
-
- The alternate method of using AnyWord is to
- skip the indexing phase and just browse the files.
- (A simple character-by-character search is also
- available in this mode.)
-
-
- Input to Finder:
-
- 1. Text Files
- lf
- lz
- ln
-
- 2. Help file
- f-help
-
- 3. Profile File
- profile.mov
- - containing:
- SUFFIX=.mov
- INDEX=NO
- ...
-
- 4. List of files to browse
- f-files.mov
-
- _______________________________
-
- C> finder profile.mov
- _______________________________
-
- Output from Finder:
- - Any extract files that you specifically create.
- Record Option (Fencing)
-
- Normally, AnyWord uses byte (character)
- offsets within a file to keep track of where a word
- appears. Optionally, the system will also track
- what "record" a word is in.
-
- A record, as far as AnyWord is concerned, is
- anything between two markers (delimiters). These
- markers can be any character or sequence of
- characters. The Movie Database, for instance, uses
- two dots (..) to separate movie records. These two
- dots appear after every movie entry so as to
- separate the information for one movie from
- that for another movie.
- Plan 9 from Outer Space
- skdjk
- erioe
- ..
- Porky's
- sdkfj
- ekrer
- ..
-
- Actually, because every line in the Movie
- Database ends with a carriage-return/line-feed
- (CR/LF) combination, the actual sequence of
- characters between two movies is:
- [CR][LF][.][.][CR][LF]
- If you specify this sequence in the profile, then
- Indexer and Finder will always know where one record
- starts and the next begins. This also means that
- the system will know where the first line of a
- record is, where the second is, etc. This is used
- effectively in the Movie Database to find movie
- titles, because the title is always on the first
- line.
-
- The only requirements for a marker are that the
- character sequence be unique. If the Movie
- Database had used two dots on a line by themselves
- for anything else, then the system would have
- gotten confused.
-
- A blank line can be used. The actual sequence
- to specify, then, would be:
- [CR][LF][CR][LF]
- You could use something like that on a file like
- this, where the paragraphs are separated by blank
- lines. Each paragraph would then be a record.
-
- If your text files have two blank lines to
- separate sections, for instance, then this would be
- specified as:
- [CR][LF][CR][LF][CR][LF]
- because every line, even null (blank) lines have a
- non-printable CR/LF on the end.
-
- Another possibility is the page separator
- used by some word processing software. This
- might be the sequence
- .pa or
- .bp or perhaps
- the formfeed character CTL-L.
-
- The marker you want to use is specified in the
- profile file. See the Indexer Profile section for
- details.
- The Profile
-
- As with Finder, the profile file controls the
- way Indexer operates. Three sample profile files are
- included with AnyWord:
- profile.mov - for use with the Movie Database
- profile.hlp - for use with the Help file
- profile.smp - as a prototype for your own use
-
- Profile files can have any name. By convention,
- the ones supplied with AnyWord have the name
- "profile" with an extension designating the
- collection of files they pertain to. You can, of
- course, establish your own convention, such as
- calling the movie profile: movies.pro. Picking a
- convention and sticking to it will make life easier.
-
- The same profile file is used by both Indexer and
- Finder. That is, you set the profile file up before
- indexing your textbase and you specify the same file
- when you invoke Finder.
-
- The profile contains information that is used by
- Indexer only, other information used by Finder only,
- and still other information that is shared by both
- Indexer and Finder. You can change any of the
- Finder-only information before doing a Finder run,
- but you should not change any of the shared
- information between running Indexer and running
- Finder. That is, change the shared information
- only before running Indexer.
-
- The information in the profile is in the form of
- keyword parameters. Indexer and Finder read this
- file when they start up and pull out the information
- they need.
-
- The parameters are specified according to the
- following rules:
- 1. Each parameter occupies a line by itself and
- MAY NOT spill over onto another line.
- 2. Any line that starts with an * is a comment and
- is not processed by Indexer.
- 3. Any text preceded by a space will be treated
- as a comment.
- 4. Parameters may be supplied in any order.
- 5. Parameters must begin in column 1.
-
- The format is: PARAMETER=value
-
-
- FILES
- The FILES parameter is used to specify
- which text files are to be indexed. Any
- valid DOS name is acceptable, including
- wild card characters. For instance:
- FILES=\movies\??.
- would index all files in subdirectory
- "movies" that had a two-letter file name.
-
- There can be as many FILES statements as
- necessary, though no more than
- 254 individual files can be indexed in
- any one textbase.
- For instance:
- FILES=\autoexec.bat
- FILES=*.c
- FILES=a*.doc
- Also legal:
- WSFILES=xyz
- for files that have high-order bits
- turned on, e.g. Wordstar files in document
- mode.
-
- Note: If pathchecking is specified (see
- below) and the files parameter looked like:
- FILES=*.c
- then Indexer would expect to find all of
- the "*.c" data files in the same
- sub-directory.
-
-
- PATHCHECK
- If PATHCHECK=Y, the DOS PATH will be used
- to find files. That is, if a file is not
- found in the current or specified
- directory, it will be searched for in each
- of the directories specified in your system
- path. (See the DOS manual for more
- information about PATH.)
-
- XFILE
- Text file containing words that are not
- to be processed, e.g.,
- XFILE=exclude.mov
- This "stoplist" is an ordinary ASCII
- list of words, one word per line.
-
- SUFFIX
- All of the files created by Indexer will
- have a common file extension. For
- instance, if you specify
- SUFFIX=.mov
- then the location file will be called
- f-ix.mov, the keyword file will be
- called f-ik.mov, etc.
-
- FENCE
- If FENCE=1, then the Record Option will be
- used. If FENCE=0, then it will not. If
- the record option is used, then the
- MARKER parameter must be specified.
-
- MARKER
- This specifies the marker or delimiter used
- to separate (fence off) records when the
- Record Option is used. The number
- specified here is a menu selection number.
- That is,
- MARKER=1 means: a blank line between
- records
- MARKER=2 means: two blank lines
- between records
- MARKER=3 means: two dots on a line
- by themselves
- MARKER=4 means: .pa on a line by
- itself
- MARKER=5 means: .bp on a line by
- itself
- MARKER=9 means: the CHAR parameter
- is used to
- specify the
- characters
- making up the
- marker
-
- CHARn
- If MARKER=9 is specified, then CHAR is used
- to specify the exact characters, in hex,
- making up the marker. CHAR0 is the length
- of the marker, CHAR1, CHAR2, etc., is
- each character of the marker. For
- instance,
-
- MARKER=9 use CHAR parameters
- CHAR0=6 3 characters:
- CHAR1=0d CR
- CHAR2=0a LF
- CHAR3=0c a form feed (CTL-L)
-
- This would make a record out of everything
- from the beginning of a file up to a form
- feed, the next record out of everything
- from there to the next form feed, and so
- on.
-
- UPPER
- If UPPER=Y, then the case of the text is
- ignored. That is, the text will be
- internally converted to upper case
- characters and indexed that way. So, the
- words COMPUTER and computer would be mixed
- together and in Finder, a search would
- turn up both.
-
- PLOC
- PKEY
- PFENCE
- PTEMP
- PDLM
- Several files are created by Indexer.
- Normally, these files will be constructed
- in the current disk and directory.
- However, you can assign these files to
- different disks or different
- subdirectories. For instance, if you had a
- RAMDISK D:, you might want to specify
- PTEMP=D:\
- (The slash is important.) This would assign
- the temporary work files to your RAMDISK.
- Or perhaps you want to assign the location
- file to subdirectory 'ABC'. Then you would
- specify: PLOC=\ABC. Whatever you specify
- becomes a prefix for the file name. So,
- the full name of the location file in this
- example would be:
- \ABC\F-IX.LOC
- The movable files are:
- PLOC - location file
- PKEY - keyword file
- PFENCE - fence file
- PTEMP - temporary work files
- PDLM - delimiter file
-
- ACTION
- A particular action to be taken on each
- character. (Characters include letters,
- numbers, punctuation, special codes, etc.)
- The allowable specifications for a
- character are:
- I - ignore this character completely
- K - process this character as a
- unique value
- S - process this character as a space
- (blank)
-
- For instance, assume the following:
- 2D=I (hex for '-')
- 2F=S (hex for '/')
- 30=K (hex for '0')
-
- (A complete set of hex characters and
- typical K/S/I values are included in the
- sample profiles.) So, the string:
- 000/000-0000 would be indexed as: 000 and
- 0000000 because the '/' is treated as a
- space and would separate the string into
- two words. The '-' is ignored, i.e.,
- treated as not there, so the 000-0000 is
- run together. Let's say, instead, that:
- 2F=K Now, Indexer will index 000/0000000 as
- one string, because the '/' is treated as a
- normal character.
- Interrupting Indexer
-
- Indexing a collection of files can take quite a
- while. And, in this version, there is no way to
- terminate Indexer and have it later pick up from
- where it left off. However, you can temporarily
- pause it, do some minor operation, such as a DIR
- command, and then let Indexer continue.
-
- By pressing [Esc] when Indexer is running, the
- program will pause and ask you what to do. You then
- have three choices:
- I - do nothing (return to Indexer)
- D - temporarily jump out to DOS
- Q - quit
-
-
- [I]
- Selecting the [I] option simply allows
- Indexer to continue running.
-
-
- [D]
- If you enter a [D], then DOS will get control.
- However, Indexer will remain in your PC memory and
- will be able to start back up when you type in the
- word "exit" on the DOS command line. This is
- explained (somewhat) in your DOS manual, under the
- COMMAND command. (In the DOS 3.2 manual, it's
- p.7-53.)
-
- What is happening here is that a second copy of
- the DOS command processor has been loaded.
- Naturally, with this second copy, plus Indexer, in
- your system, the amount of memory you'll have
- available will be reduced. (By roughly 150K, in
- this version.) But this should still give you
- enough to use for many programs and all DOS
-
- commands. The big benefit here is that when you
- type in [exit], the secondary copy of DOS will go
- away and you'll be returned back to the point in
- Indexer where you left off.
-
- NOTE: You may interrupt Indexer as frequently as
- you desire. Indexer remains in a suspended
- state until allowed to continue.
-
-
- [Q]
- Pressing [Q] will permanently terminate this
- Indexer run. All results will be discarded.
- FInder Command List
-
- All commands are entered by typing in the
- first letter of the command whenever the menu
- is on the screen.
-
- Help
- - Displays portions of this manual
-
- Search
- - Find an indexed word (or words)
- among the text files
-
- Display
- - View the results of a search
-
- Call
- - Call a DOS command or other program
-
- Interrupt
- - Temporarily exit to DOS.
- (Return is by typing EXIT)
-
- Review
- - Display past search results
- (for this Finder session)
-
- View
- - Display all indexed words
-
- Kill
- - Free up memory for additional searches
-
- Where
- - Display the locations where the
- search terms were found
-
- Find
- - Alternate search method: simple
- string matching
-
- Mode
- - Change the Finder environment;
- i.e. change how Finder works
-
- Quit
- - Terminate Finder
-
-
-
- The following keys also have special meaning:
-
- PgDn
- - Display the next text segment (the one after
- the one that was most recently displayed)
-
- PgUp
- - Display the previous text segment (the one
- that immediately precedes the most recently
- displayed text page)
-
- Home
- - Display the first text segment in the database
-
- End
- - Display the last text segment in the database
-
- Ctrl/Home
- - Display the first text segment in
- the current file
-
- Ctrl/End
- - Display the last text segment in
- the current file
-
- Ctrl/PgUp
- - Display the first segment of the
- previous text file
-
- Ctrl/PgDn
- - Display the first segment of the
- next text file
-
- Note: The size of the segment to be displayed
- can be set via the profile file or the mode command.
- Remember that the number of bytes specified is HALF
- of what will be seen on the screen.
- Record Option (Fencing)
-
- Normally, AnyWord uses byte (character)
- offsets within a file to keep track of where a word
- appears. Optionally, the system will also track
- what "record" a word is in.
-
- A record, as far as AnyWord is concerned, is
- anything between two markers (delimiters). These
- markers can be any character or sequence of
- characters. The Movie Database, for instance, uses
- two dots (..) to separate movie records. These two
- dots appear after every movie entry so as to
- separate the information for one movie from
- that for another movie.
- Plan 9 from Outer Space
- skdjk
- erioe
- ..
- Porky's
- sdkfj
- ekrer
- ..
-
- Actually, because every line in the Movie
- Database ends with a carriage-return/line-feed
- (CR/LF) combination, the actual sequence of
- characters between two movies is:
- [CR][LF][.][.][CR][LF]
- If you specify this sequence in the profile, then
- Indexer and Finder will always know where one record
- starts and the next begins. This also means that
- the system will know where the first line of a
- record is, where the second is, etc. This is used
- effectively in the Movie Database to find movie
- titles, because the title is always on the first
- line.
-
- The only requirements for a marker are that the
- character sequence be unique. If the Movie
- Database had used two dots on a line by themselves
- for anything else, then the system would have
- gotten confused.
-
- A blank line can be used. The actual sequence
- to specify, then, would be:
- [CR][LF][CR][LF]
- You could use something like that on a file like
- this manual, where the paragraphs are separated by
- blank lines. Each paragraph would then be a record.
-
- If your text files have two blank lines to
- separate sections, for instance, then this would be
- specified as:
- [CR][LF][CR][LF][CR][LF]
- because every line, even null (blank) lines have a
- non-printable CR/LF on the end.
-
- Another possibility is the page separator
- used by some word processing packages. This
- might be, for instance, the sequence:
- .pa or
- .bp or perhaps
- the formfeed character CTL-L.
- The Finder Profile
-
- As with Indexer, the profile file controls the
- way Finder operates. Three of these profile files
- are included with AnyWord:
- profile.mov - for use with the Movie Database
- profile.hlp - for use with the Help file
- profile.smp - as a prototype for your own use
-
- Profile files can have any name. By convention,
- the ones supplied with AnyWord have the name
- "profile" with an extension designating the
- collection of files they pertain to. You can, of
- course, establish your own convention, such as
- calling the movie profile: movies.pro. Picking a
- convention and sticking to it will make life easier.
-
- The same profile file is used by both Indexer and
- Finder. That is, you set the profile file up before
- indexing your textbase and you specify the same file
- when you invoke Finder.
-
- The profile contains information that is used by
- Indexer only, other information used by Finder only,
- and still other information that is shared by both
- Indexer and Finder. You can change any of the
- Finder-only information before doing a Finder run,
- but you should not change any of the shared
- information between running Indexer and running
- Finder. That is, change the shared information
- only before running Indexer.
-
- The information in the profile is in the form of
- keyword parameters. Indexer and Finder read this
- file when they start up and pull out the information
- they need.
-
- The parameters are specified according to the
- following rules:
- 1. Each parameter occupies a line by itself and
- MAY NOT spill over onto another line.
- 2. Any line that starts with an * is a comment and
- is not processed by Indexer.
- 3. Any text preceded by a space will be treated
- as a comment.
- 4. Parameters may be supplied in any order.
- 5. Parameters must begin in column 1.
-
- The format is: PARAMETER=value
-
- Many of the Finder parameters can be changed
- while running Finder, by using the [M]ode command.
- Those parameters will only be briefly mentioned here
- with more detail in the Changing the Environment
- section.
-
- DRANGE
- - Display range: amount of text to show.
- Ex.: DRANGE=8L
-
- LIMIT
- - Number of characters between search terms.
- Ex.: LIMIT=20
-
- OR
- - Character to be used for OR searches.
- Ex.: OR=|
-
- AND
- - Character to be used for AND searches.
- Ex.: AND=&
-
- NOT
- - Character to be used for NOT searches.
- Ex.: NOT=!
-
- OPTIONS
- - Character on a command line that
- denotes optional information follows.
- Ex.: OPTIONS=;
-
- RESULT
- - Character that indicates the following
- number is the number of a search set.
- Ex.: RESULT=-
-
- WILD
- - The character that will match any text
- character(s).
- Ex.: WILD=?
-
- WORDS
- - If Y, indicates that the words AND, OR, NOT
- will be reserved for internal use and will
- not be searchable.
- Ex.: WORDS=Y
-
- EXTRFILE
- - File into which extracted text will be
- placed.
- Ex.: EXTRFILE=subfile.mov
-
- EXTRTYPE
- - Indicates on what basis text extraction will
- be done.
- Ex.: EXTRTYPE=N
-
- INDEX
- - If YES, then Indexer has been (or will
- be) run on the files specified in the FILES
- parameter. This will create the index
- files that Finder uses for looking up words.
-
- If NO, then Finder is to be used on
- files that have not been indexed. The
- FILES parameter, which is used only by
- Indexer, will of course be ignored.
- Instead, a list of files (with no wild
- cards) must be put into a file called
- F-FILES.ext (where ext is specified by
- SUFFIX=). Note: Without indexing,
- Search, Display and some other
- commands will not work. However, you
- can still browse the files and use other
- commands such as [F]ind.
-
- Function Keys F1 through F10
- - Assign a character string or special
- operation to a function key.
- Ex.: F3=/1-10 ;l=a1
-
- PATHCHECK
- - If Y, the DOS PATH will be used to find files.
- Ex.: PATHCHECK=Y
-
- SUFFIX
- - Filename extension used in this textbase
- for files created by Indexer.
- Ex.: SUFFIX=.mov
-
- FENCE
- - If FENCE=1, then the record option will be
- used, if FENCE=0, then it will not. If the
- record option is used, then the MARKER
- parameter must be specified.
-
- MARKER
- - If the Record (fencing) Option is used,
- this specifies what characters are used to
- separate (delimit) records, i.e., fence off
- each segment of text from other segments.
- See the Indexer Profile section for more
- information.
-
- CHAR
- - The specification of the delimiter for the
- Record Option. See the Indexer Profile
- section for more information.
-
- UPPER
- - If all searches are to ignore the case
- of the text, then specify UPPER=Y, otherwise if
- lowercase characters are to be treated
- differently than uppercase characters, specify
- UPPER=N.
-
- XFILE
- - The name of the file containing the list of
- words that have not been indexed.
- Example: XFILE=exclude.hlp
- Changing the Environment
-
- You can easily control many of the things that
- AnyWord does. The primary way to do this is
- through the profile file, discussed elsewhere.
- Another way, which you can do after the program has
- begun running, is via the MODE command. The MODE
- command allows run-time access to things that would
- normally be specified in the profile. This lets you
- change things on the fly as well as allowing you to
- make temporary changes to the way AnyWord runs
- without affecting future runs.
-
- The parameters that can be changed with the
- [M]ode command are:
-
- Display range
- The number of characters of text to display
- on each side of a search term when showing
- the results of a search. If followed by an
- L, then it is the number of lines. Zero
- (0) means display an entire delimited
- (fenced) record instead of a character
- count.
- Examples: 100 - 100 characters
- 8L - 8 lines
-
- Limit
- The number of characters that search terms
- connected by an operand (AND, OR, NOT)
- must be within. A zero means that the
- search terms must simply be within the same
- record.
-
- Example 1: With limit at 20, a search for:
- Universit? and Illinois
- would get:
- University of Illinois
- (The beginning of the word University is
- within 20 characters of the word Illinois.)
- It would also retrieve:
- universities in Illinois
- but would not retrieve:
- "universities and other institutions
- in Illinois"
-
- Example 2: The reverse situation
- would hold for:
- Universit? not Illinois
- This would not retrieve "Unversity of
- Illinois" but would retrieve "universities
- and other institutions in Illinois".
-
- Fudge
- The number of characters that search words
- within one term can be within.
-
- Example: search for University of
- Illinois with Fudge set at 15. Assuming
- "of" is not a searchable word (i.e., it was
- specified in the exclude list), then the
- words University and Illinois must be
- within 15 characters of each other. The
- results would then be the same as for a
- Limit set to 10, explained previously.
-
- Marker
- The delimiter used for the Record
- Option ("fencing"). You will be prompted
- to enter a character string.
-
- Pathcheck
- If Yes, then the subdirectories in the DOS
- PATH will be searched (if necessary) to
- find files.
-
- And character
- The character, normally "&", to use for
- AND.
-
- Or character
- The character, normally "|", to use for OR.
-
- Not character
- The character, normally "!", to use for
- NOT.
-
- Set character
- The character, normally "-", to use to
- refer to a previous search set
-
- Wild character
- The character, normally "?", to cause
- Finder not to match characters in the
- search argument past this point
-
- Option marker
- The character, normally ";", to tell Finder
- that a special option follows on the
- command line
-
- Reserved words
- The words AND, OR, NOT will be used in
- addition to the special characters defined
- for these functions.
-
- Uppercase
- Yes, if all searches are to ignore the case
- of the text; No, if lowercase characters
- are to be treated differently than
- uppercase characters
-
- Type of Extract
- If automatic, extracts all displayed text.
- If semi-automatic, asks before extracting.
- If manual, allows comments with each
- extraction. Or none. (With "none"
- specified, Extract will only be done with
- the ;E option or [E]xtract command.)
-
- Com file drive
- The drive designation, e.g. A, C, etc.,
- where your DOS COMMAND.COM file is located.
- (If not changed, will be the one specified
- in the DOS COMSPEC environment variable.)
-
- Extract file
- File to be used for extracted text, unless
- over-ridden by the [E]xtract or [D]isplay
- commands
-
- Index
- If No, then the filenames in f-files.xxx
- have not been indexed and only the
- browse/find commands will be operative on
- them.
- Searching
-
- The [S] option off the main menu is used to do
- text lookups. Once you've pressed the "S", Finder
- will prompt you for a search string and allowable
- options.
-
- Finder uses the "set" concept of search and
- retrieval. That is, when you do a search, Finder
- keeps a list of the places in the text where your
- search word or phrase was located. This list is
- kept until you decide, with the [K]ill command, to
- throw it away.
-
- For instance, let's say your first search was for
- the word 'future' and that it appears 5 times in
- your text files. All 5 occurrences are called a
- "set". Let's say further that the first time it
- appeared was in file abc, the second time in file
- def, the third time in file ghi, and so on. If you
- just wanted to look at the word "future" in file
- ghi, then what you would want would be the third
- "element" in the first search set. When you wanted
- to display the text, you would specify this
- particular appearance of 'future' as '1/3' -- set 1,
- element 3.
-
- Let's say your second search would be for the
- word 'past' and there are 9 occurrences of this word
- in your files. These words could be specified as
- 2/1 through 2/9.
-
- Of course, the results for search set 1 are still
- around and can still be retrieved or re-used. With
- this method, each place your requested word(s) have
- been found will have a unique designation.
-
- When would you want to go back to the results of
- a particular search? If you wanted to look at the
- results again, perhaps. Or if you wanted to narrow
- down a search. Suppose for instance, that a search
- for the word "future" reported 200 occurrences. You
- probably don't want to read all 200 text segments,
- particularly if you're only looking for one in
- particular.
-
- Maybe you're only interested in those text
- segments where the word "future" also appears with
- the word past. You could do a search for:
- future and past
- Or, you could do:
- -1 and past
- What this means is: use the results of search 1
- as part of a new search. This was a trivial
- example, but you might see how this can be quite
- powerful. Because AnyWord "remembers" as many
- as 100 search requests, some complex searches can be
- done. For instance:
-
- Set # Terms Results
- 1. future not time 200 occurrences
- 2. -1 and past 11
- 3. -2 or Dickens 14
- 4 Christmas 6
- 5. -3 and -4 2
-
- Just as an aside, for those who have used
- a mainframe
- search service, this is the same as:
- (((future not time) and past) or Dickens)
- and Christmas
- With that method, it's easy to make errors with
- the parentheses and get an erroneous result -- which
- may not be spotted right away. The method Finder
- uses, which is modeled after Lockheed's Knowledge
- Index, is more flexible and less error-prone.
-
- Multi-word searches are also possible. For
- instance, using the Movie Database for our examples,
- 'Louis Gossett, Jr.' can be searched for. Or Edgar
- G. Robinson. 'Edgar Robinson' would also produce
- the same results, because Finder will match all the
- Edgars with all the Robinsons, rather than doing a
- match for a single piece (string) of text.
-
- Actually, Finder would have reported a match if
- it found the string: Robinson, Edgar G. -- because
- the order of the search words doesn't matter to
- Finder. What does matter is the "fudge factor", set
- in the Mode command, which can be used to control
- how far afield Finder will go in trying to produce a
- match.
-
- A single "wild-card" character is permissible per
- search term. (A "term" would be 'future' or 'Edgar
- G. Robinson' in the above examples. 'Debra Winger
- & Nick Nolte' would be 2 search terms. Note that
- '&' is equivalent to 'and'.) The wild-card
- character, denoted by a '?', allows a match for
- anything. For instance, John? would produce
- matches for John, Johnson, Johnston, etc.
-
- You can specify the number of bytes (characters)
- that search terms can be separated by. This can be
- specified in the Environment for an entire session
- or on the search statement as ;R=nnn, For instance:
- winger & nolte ;R=50
- would tell Finder to mark only text sections
- in which these terms were separated by 50 bytes or
- less.
-
- Searches can be restricted to particular lines
- within records, if the Record (fencing) Option had
- been used when the files were run through Indexer.
- A semi-structured collection of text files, like The
- Movie Database, might have particular information in
- the first line of an entry (record), such as a
- title. You can therefore restrict your search that
- way. For instance, a search for:
- York ;L=A1
- would return the movie entry with the title
- "Sergeant York", but not any movies with Susannah
- York.
-
- The semi-colon (;) above indicates that an option
- follows. (You can change all of the special
- characters if you desire.) The 'L' means a
- particular line is requested. The 'A' is always
- present in this version of Finder. The '1' means
- that you want the search restricted to the first
- line of a record.
-
- Another line option is ;L=S. For instance,
- D & Woody Allen ;l=s
- would mean that you wanted to select all movie
- records with Woody Allen listed as the director.
- Search Examples
-
- The following searches, using The Movie Database,
- are marked by [OK] if legal, [BAD] if not legal.
-
- 1. Debra Winger [OK]
-
- 2. winger and nolte [OK]
-
- 3. schwarz? [OK]
- (Find all keywords starting with SCHWARZ)
-
- 4. arnold schwarz? [BAD]
- (Can't have multiple words & a wild card in
- the same search term)
-
- 5. arnold and schwarz? ;r=7 [OK]
- (An alternative to #4; the r=7 means
- the two terms must be within 7
- characters of each other.)
-
- 6. woody allen & diane keaton [OK] ("&" is "and")
-
- 7. woody allen | diane keaton [OK] ("|" is "or")
-
- 8. woody allen ! diane keaton [OK] ("!" is "not")
-
- 9. allen & keaton & hemingway [BAD]
- (Can't have more than 2 search terms/line)
-
- 10. toshiro mifune [OK]
-
- 11. -9 & Akira Kurosawa [OK]
- (Use the result of a previous
- search, search set #9)
-
- 12. -11 & 1981 [OK]
- (Narrow a search down further)
-
- 13. james ;l=a1 [OK]
- (To find the name James only on the
- first line of a record, which, in the
- Movie Database, is the title line)
-
- 14. time & time ;l=s [OK]
- (To find a movie with the word "time"
- repeated twice on the same line).
-
- 15. Robert Redford ;l=a1 [BAD]
- (Can't do multiword search & line
- option together)
-
- 16. james not -13 [OK]
- (To find james not on a title line)
- Displaying Text
-
- Once text has been located with the [S]earch
- command, you can use the [D]isplay command to view
- it. (See the "Browsing" section for additional
- commands.)
-
- The [D]isplay command works on the same
- set/element concept that [S]earch does. That is,
- using the Movie Database for an example, if
- search set #1 reported 6 movies with Debra Winger,
- you can view each separately by doing a display for
- 1/1, 1/2, 1/3, etc. You can also display them
- continuously, e.g. 1/1-4.
-
- Another option lets you display only particular
- lines of each record, e.g., 1/1-6 ;L=A1. (The L
- means line, the A means the first delimiter type,
- always an "A" in this version of Finder, and the 1
- means the first line of the specified records, which
- for The Movie Database, would be the title of the
- movie. Please note that your files must have been
- indexed with the Record Option in order to select
- particular lines of particular records.)
-
- Using the [M]ode command lets you change the
- amount of text that you can display at one time.
- (You can also specify this in the profile file.)
- Text display can be by record, by a character (byte)
- count or by a line/paragraph count -- actually a
- count of carriage returns. For The Movie Database,
- the delimiter display method is the default.
-
- Note: if you set a value too low, you'll find
- that the browsing functions will seem to be stuck.
- In this case, just reset the display range to a
- larger number. Conversely, if the displayed text
- consistently spills over to another screen, set the
- range to a lower number.
-
- Displayed text can also be copied (extracted)
- into other files. This can be done automatically or
- individually. See the "Extracting Text" section and
- the "Display Examples" section for further
- information.
- Display Examples
-
- The following examples show you how to
- display the results of particular searches. First,
- press [D] from the main menu, then you will be
- prompted for a set/element number. Enter the
- information and press [ENTER].
-
- 1.
- (A null line, i.e., a RETURN/ENTER by itself.)
- Display the first or next element of the last
- (current) search set.
-
-
- 2. 3
- Display the first or next element of set #3
-
-
- 3. 4/2-5
- Display the 2nd through 5th occurrences of
- the search terms specified in set 4.
-
-
- 4. 2/1-9 ;l=a1
- Display the first line of each of elements
- 1-9 in set 2.
- (Assumes Record Option in use)
-
-
- 5. 10/3 ;l=a2
- Display only the 2nd line of the third
- element in set 10.
- (Assumes Record Option in use)
-
-
-
- 6. 1/1-999
- Display the full range of occurrences for
- set #1. (If there aren't 999 elements
- in set 1, then Finder will come back with
- the correct number.)
-
-
- 7. 13/6 ;E=M
- Extract this text into a file and prompt
- for a file name and additional comments.
-
-
- 8. 13/1-7 ;E=S
- Using a pre-selected file name and
- prompting you only for a yes/no, extract
- the text associated with elements numbered
- 1 through 7.
-
-
- 9. 15/10-30 ;E=A
- Without asking, extract all displayed text
- as it is put to the screen.
-
-
-
- Some illegal display responses:
- 0/1-999 (Set numbers start with 1)
- 1/0-999 (Element numbers start with 1)
- 5/1-8 ;E=A ;L=A1 (This version of Finder allows
- only one option per display
- command; if you need to set
- both, use the [M]ode command.)
- Identifying Located Files
-
- After you've done a search, you may want to see
- where the search term[s] were found. For instance,
- if a search for the word 'future' reports 100
- occurrences in the database -- too many to look at
- -- you may want to see which files the word was
- found in.
-
- The [W]here command produces output that
- looks like:
-
- 4/1: at 703 (hex 2BF) within file OZ
- 4/2: at 2001 (hex 7D1) within file TF
- ...
-
-
- Here's what this means:
-
- 4 - The number of the search set.
- 1 - An element within the set. There will be one
- line of output for each element (occurrence
- of the search terms) within the set.
- 703 - The number of characters (offset) from the
- start of the file where the term(s) were
- found.
- 2BF - The offset, expressed in hexadecimal
- notation. (This can be useful with other
- programs that let you access parts
- of a file by location.)
- OZ - The name of the file in which the search
- term(s) were found.
-
- Similarly, the second line of the output shows
- you the second occurrence of the search terms.
-
- When you press [W] on the main menu, you will
- be prompted for the number of a search set to look
- at. You may also specify a particular element or
- range of elements, such as:
- 2/1-10
- 3/4
- (This is similar to what you'd enter for the
- [D]isplay command.)
- Printing
-
- There are three ways to get a printout of what
- Finder displays on the screen.
- a) Shift/PrtSc
- b) CTL/P or CTL/PrtSc
- c) Re-directing I/O
-
- At any time in the course of running Finder
- you can press the Shift/PrtSc keys and send the
- contents of the screen to your printer.
-
- If you want to run your printer simultaneously
- with the screen display, then "toggle" the CTL/P or
- CTL/PrtSc keys. This is a little-known but very
- useful feature of PC-DOS. For instance, if you want
- to print some text that your search has found, press
- the CTL/P keys before the text starts to display on
- your screen. After the text has been displayed, you
- can press those same keys again to turn the printer
- off. Of course, unless you have an extremely fast
- printer, this will slow down the rate at which
- output will be displayed on your screen.
- Nonetheless, this method can provide a useful log of
- your session.
-
- The third method is to redirect normal Finder
- output to the printer when starting up the program.
- Through DOS redirection, all text that Finder
- locates can be automatically printed or saved into a
- file or sent to another device, e.g., a
- communications port. This is done by starting
- Finder with:
- finder profile.mov > LPT1:
- or finder profile.mov > out.mov
- Extracting Text
-
- Anything in your text files that can be displayed
- can be copied into new file(s). For instance, let's
- say that, using The Movie Database, you have done a
- search for Edward Woodward and then displayed the
- entry for "Breaker Morant". You have also
- specified, in your profile or with the [M]ode
- command, that text is to be sent to file AMOV.TXT.
- Now, if you select [E]xtract from the menu, the
- information on "Breaker Morant" will be added to
- file AMOV.TXT.
-
- You can use the same output file for each
- extraction, or you can change output files when the
- [E]xtract command asks you for a file name.
-
- You can also precede each data extraction with
- comments or other information typed in from the
- keyboard. For instance, you could have typed in:
-
- ===> Australian movie about a Boer War
- incident.[ENTER]
- cf. "Gallipoli". [ENTER]
- [ENTER]
-
- and this would go into the output file, followed by
- the entry on "Breaker Morant".
-
- Extractions can also be done via the [D]isplay
- command. Within the profile, or using the [M]ode
- command or in the [D]isplay command itself, you can
- specify: "manual", "semi-automatic" or "automatic"
- extraction -- as well as "none".
-
- "Manual" means that Finder will ask you if you
- want to extract the text every time a search element
- has been displayed. It will also ask you to enter
- comments and a file name. In semi-automatic mode,
- Finder will only ask you if you want to perform the
- extraction on a particular display or not; if yes,
- it will be written into the same extract file as was
- the previous. In automatic mode, Finder doesn't
- even ask -- it just copies the text into the
- pre-selected file.
-
- Note: Specifying a device, like LPT1:, instead
- of a file name is perfectly legal, and would send
- the extracted text to that device. (LPT1: is the
- printer connected to parallel port 1.)
- Viewing Searchable Keywords
-
- The [V]iew command lets you look at all of the
- words in the database that have been indexed. You
- can start at any point and continue for as long as
- you like. The number of times an indexed word
- occurs in the database is displayed next to that
- word.
-
- For instance, to see all the words starting with
- 'b', just enter: b[RETURN] when asked for a
- starting point. Press Esc when ready to stop.
-
- To see all of the words in the index, enter a
- null line when asked for a starting point. (A null
- line is just the RETURN/ENTER key by itself, with
- nothing proceeding it.
-
- Like the output of the Display command, the
- output of View can be redirected to a file or to
- another device, such as a printer. For instance, if
- you start Finder with:
- finder profile.mov > movielst
- and then, when the menu comes up, press V, the list
- of words will be written out to a file called
- "movielst" (instead of being written to the screen.)
- You will still get the "Press Esc or any other key
- to continue" message. Continue pressing a key
- whenever this message comes up. When the
- complete list has been written out, the main menu
- will return to the screen.
-
- Alternatively, like the output of the Display
- command, you can press CTL-P or CTL-PrtSc at any
- time to send the results of a View command to the
- printer, in addition to seeing it on the screen.
- (Press CTL-P or CTL-PrtSc again to turn off the
- output to the printer.)
- Reviewing Previous Searches
-
- All of the searches in a Finder session are kept
- in memory. You can look at the results of these
- searches with the [R]eview command.
-
- Complicated searches can be built using the
- results of previous searches. See the "Search"
- section for further information.
-
- The actual output of the [R]eview command might
- look like: (Using the Movie Database)
-
- Set 1
- 6 occurrences for Debra Winger
- 13 occurrences for Nick Nolte
- 1 occurrence for Debra Winger & Nick Nolte
-
- Set 2 (Error)
- 0 occurrences for L?
-
- set 3 (Interrupted)
- 100 occurrences for L?
- 200 occurrences for B?
- 0 occurrences for L? | B?
-
- set 4
- 1 occurrence for -1
- 2 occurrences for John Steinbeck
- 1 occurrence for -1 & John Steinbeck
-
- The search terms and results for each set are
- listed. If processing of a search did not go to
- completion, then the set will be marked as being in
- Error or having been Interrupted.
-
- The [R]eview command will prompt, like the
- [D]isplay command, for the number of the set you
- want to look at. It will also accept a range, such
- as 1-10, or the letter 'a' (for all sets).
- Browsing Through Text
-
- Your "database" consists of a collection of
- files. Finder understands that and provides a
- flexible means of browsing these files. From any
- starting place, you can navigate in any direction
- within the database. Your starting place is
- determined by the last [D]isplay command you've done
- or, if no [D]isplay was done, then the beginning of
- the first file. You can use the following keys to
- navigate:
-
- PgUp - previous record or text block
- PgDn - next record or text block
- Home - beginning of the database
- End - end of the database
- Ctrl/Home - beginning of the current file
- Ctrl/PgUp - beginning of the previous file
- Ctrl/PgDn - beginning of the next file
- Ctrl/End - end of the current file
-
- The amount of text displayed on the screen, the
- text block, depends on what has been set in the
- environment, i.e., with the profile file or the mode
- command.
-
- For instance, if the display range has been set
- to 50, then 100 characters (approximately) will be
- displayed. If the range has been set to 0, then the
- display will be one "record", e.g. all of the text
- between two delimiters. You may find it more
- convenient to specify a line count: 9L would show
- 18 lines of text on the screen. (Note: the number
- specified in the range is the number of characters
- or lines BEFORE the midpoint and a similar number
- after. This was done to be compatible with the
- [D]isplay command.
-
- It is possible to browse files other than ones
- that have been indexed. If you have run Indexer, a
- file was created called F-FILES.ext, where "ext" is
- a unique extension. If you edit F-FILES.ext and add
- additional files at the end, you will be able to
- browse them. Of course, none of the commands that
- rely on indexing -- e.g. search, display -- will
- function will these added files.
-
- It is also possible to create an entire
- collection of files, a "textbase" without indexing
- it. First, create a F-FILES.ext, where "ext" is a
- unique extension. Place into this file the names of
- your unindexed text files (no wild cards). Create a
- profile file specifying:
- SUFFIX=ext and
- INDEX=NO
- in addition to the usual parameters. Now you can go
- into Finder specifying this profile and be able to
- perform browsing operations, as well be able to use
- the [F]ind command.
- When Text Files Change
-
- Finder works best when the text files are static,
- at least between Indexer runs. But all is not lost
- when you change your files without having a chance
- to run Indexer on them again.
-
- Of course, if you add new text into a new file
- then AnyWord's indexing won't be disturbed.
- The second best thing is to add new text at the end
- of existing indexed files. This will not change the
- locations of already indexed keywords. (Of course,
- in both cases, the new material won't be indexed.)
-
- New text that is substituted for old on a
- character-for-character basis won't affect the
- majority of your text, but the indexes will still
- point to the old, deleted material. The worst
- thing, as far as maintaining usable indexes goes, is
- to delete or insert material at the beginning of
- files. This will throw the indexes to everything
- else in that file off by the number of characters
- you've inserted or deleted. Still, you can use
- Finder to go to the approximate spot where it thinks
- it located a search string and then browse
- or use [F]ind to locate its exact position.
-
- Another solution, and one that I've used in a
- crunch, is to save the original text files in a
- separate subdirectory or on separate floppies. Use
- this set of text files with Finder, noting the
- filenames and offsets of the found search terms.
- Then use Finder (or other browse/edit program)
- against the newest files, using these offsets. This
- allows you to narrow down a search pretty well
- before locating the keyword in the new files. You
- can combine this approach with the technique
- mentioned in the "Browse" section -- the names of
- any unindexed file can be added to the end of the
- f-files file.
- Alternate Search Method
-
- O.K., you know something is in there somewhere,
- but the Search command doesn't find it. Maybe it
- was excluded during the indexing process. Perhaps
- it's been added to the database after it was
- indexed. Perhaps you want to find something that's
- not a complete word, but just a string of characters
- that appear in the text. Or maybe you just don't
- believe the Search command works correctly. (Shame
- on you!)
-
- Well, wonder no more. The [F]ind command
- will accept your text string and try to match it
- against the text in the database. It will take
- whatever you type in and, starting with wherever you
- currently are in the database, will look for exactly
- that text. For instance:
-
- F The Who will do a match for The[b]Who
- (where [b] is a blank)
- F ... will do a match for three dots
- (an ellipsis)
- F porting will find references to importing,
- exporting, etc.
-
- Each line or paragraph in the database is
- individually searched. Please note, because
- searching is on a line by line basis, that the
- search phrase can not span a [CR][LF]
- (carriage-return/line-feed) boundary, even if CRs
- and LFs are marked for keyword processing in the
- profile. (By "line" or "paragraph", we mean
- anything delimited by CR/LFs.
-
- When [F]ind gets a match, it will display the
- text surrounding the located string. (The amount
- that is displayed depends on the environment; i.e.
- the DRANGE parameter set in the profile file or the
- [M]ode command.) Having displayed the first
- occurrence of the text, [F]ind will ask you whether
- you want to go on.
-
- Note that this is a "brute force" character-by-
- character search and that there is no way of
- knowing how many occurrences of the search argument
- you will find within the database. You also need to
- give it time to run, as this type of search can be
- slow.
-
- Note also that if the current environment
- specifies "uppercase", then case will be ignored
- during the matching process.
- Help
-
- Selecting 'H' from the menu loads the F-HELP file
- and puts up a list of topics on the screen. Select
- a topic by striking its designated letter or number.
- Use the Esc key to exit out of a particular topic
- and put the topic list back up on the screen.
- Strike Esc again, and you will go back to the main
- menu.
-
- You can put your own information into Help, giving
- you a useful means of recalling notes. For instance,
- you might want to put information about a specific
- database into the help file.
-
- Because the Help file is a simple ASCII (text)
- file, you can easily modify it using any text editor
- that can read/write ASCII files and which will put a
- "hard" carriage return at the end of each line.
- This includes Edlin, PE, Edix, Wordstar
- (non-document mode), PMATE, Brief, etc.
-
- All headings that show up on the Help screen are
- taken from lines in this file with a form feed
- (CTL-L) in the first column. So, new headings can
- easily be added and old ones changed or removed.
-
- Finder does not care what the order of the topics
- is, so your only constraint is what will fit on the
- Help screen. Topic titles, for instance, can only
- be 30 characters long and there can be no more than
- 34 topics (2 columns of 17) total. Reminder:
- don't forget to take the border into account when
- adding text: restrict each line to 72 characters.
- Freeing Memory
-
- As you do each search, Finder internally records
- all of the text locations that match your search.
- This location information is kept in memory for
- potential use in later searches.
-
- If you reach a point where a search (or other
- operation) fails because of a lack of memory, you
- will probably want to get rid of any old search
- results that you don't need. Use the [K]ill command
- to do this.
-
- By "killing" a search set, the memory devoted to
- that set can be re-used.
-
- When prompted for a number, enter the number
- of a search set that you don't need. (In order to
- free up enough memory for particular operations, you
- may need to use the [K]ill command several times.)
-
- Use the [R]eview command to look at your previous
- searches to see which ones you don't need to keep
- around.
- Interrupting Finder
-
- Normal Finder processing can be interrupted in
- two ways. The CALL command allows you to run a
- program of your own choosing. The INTERRUPT command
- allows you to temporarily go to DOS, execute
- whatever commands and programs you want to, and then
- return to Finder with all of your previous search
- results still intact. Note: either option requires
- you to have enough memory space in your machine to
- load the PC-DOS command processor (COMMAND.COM) or
- its equivalent.
-
- The MODE command, with the 'C' option, allows you
- to tell Finder that COMMAND.COM is on a different
- drive from the one you booted with. However, DOS
- will still want to use the original COMMAND.COM, so
- you may find, after using the INTERRUPT command,
- that the new DOS session won't work quite the way
- you expect. Put another way: if you don't need to
- change the drive, don't.
-
- Two things you might want to interrupt
- Finder to do are:
- - look at a file with the DOS TYPE command
- - run a second copy of Finder with a different
- textbase.
- Calling Other Programs
-
- The [C]all command allows you to enter the name
- of a program or DOS command, just as you would type
- it in on the DOS command line. This can be any
- program. The only restriction is memory. You will
- not have as much memory available as you would if
- Finder were not running. As with the [I]nterrupt
- command, Finder stays in memory. Unlike [I],
- however, a second copy of the DOS command processor
- is not loaded, so there will be somewhat more memory
- available with [C]all than with [I]nterrupt.
- Specifications
-
- Text Capacity/Type
- ------------------
- Maximum text per database: 1 MB (500K when Record Option used)
- Maximum number of files: 254
- Maximum text per file: 1 MB (500K when Record Option used)
- Maximum number of drives/subdirectories: 254
- Preferred file type: ASCII file or Wordstar document
- Secondary type (usable with limitations):
- WordPerfect, Microsoft Word, others
- Allowable text characters: Any (binary 0 to binary FF)
- Maximum number of words beginning with the same character: 16384
-
- Search Capabilities
- -------------------
- Wild card: for suffixes (e.g. "comput?" gets
- computer, computing, etc.)
- Mix a wild-card and multi-word terms in the same
- search statement: No
- Boolean operators: AND, OR, NOT
- Proximity searches: By number of characters
- Maximum number of terms (operands) per search
- request: 2 (e.g. black cat and dog)
- Maximum number of operators per request: 1
- Maximum number of search options per request: 1
- Maximum number of words per search term: 20
- Maximum number of characters per word: 80
- Recombine previous search requests into more
- complex searches: Yes
- Maximum number of search requests per session: 100
- Review previous search requests: Yes
-
- Display Capabilities
- --------------------
- In-context text display: By number of lines,
- number of characters, or entire record
- Display instances of located text: one at a time
- or range of occurrences
- Highlight located text: First character (on most machines)
- Re-display previously located text: Yes
- View list of file names where word/phrase was located: Yes
- Browse across multiple files: Yes
- Maximum number of occurrences of a word: 1000
-
- Browsing Files
- --------------
- Page Up/Down: Yes
- Go to beginning/end of current file: Yes
- Go to beginning of next/previous file: Yes
- Go to beginning/end of entire textbase: Yes
- Paging options (e.g. PgDn) highlight
- search terms: No
-
- Optional Record Processing
- --------------------------
- Acceptable record separators: any character or
- string of characters
- Maximum number of lines per record: 255
- Maximum number of occurrences per keyword for which
- record information will be kept: 576
-
- Miscellaneous
- -------------
- View alphabetical list of indexed words: Yes
- Extraction of located text into new files: Yes,
- automatically or one text segment at a time
- Ability to suspend operation to execute DOS commands: Yes
- Ability to suspend operation to run other programs: Yes
- Function keys: optional - user definable
- Ability to use DOS PATH to locate files: Yes
-
- Indexing
- --------
- Index all words in specified files: Yes, except for
- user-modifiable (stop) list of words not to index
- Maximum number of words in the stop list: 500
-
- User Customization
- ------------------
- Re-define reserved words (e.g. and, or, not, ?): Yes
- Re-define concept of a "word" for special processing: Yes
- Set proximity between multiple words in a search term: Yes
-
-
-
- Idiosyncrasies/Bugs
- -------------------
- 1. The 'OR' search operation may
- produce a match count larger than what
- you might expect. That is, a search
- for:
- Clint Eastwood or Sondra Locke
-
- will count all of Eastwood's movies and
- all of Locke's movies -- thus counting
- some movies twice.
-
- 2. Indexer has been known to occasionally incorrectly
- compute a line location when the Record Option is used.
- That is, when you do a Finder search that is
- restricted by line, you may occasionally miss a
- record because its line number is off by 1 or
- got incorrectly set to 0. Also, keep in mind
- that only the first 500 occurrences for a keyword
- will have line information.
-
- Files
-
- Several files, besides your own text files, are
- required to run Finder. These may be in the current
- subdirectory or, unless pathchecking has been turned
- off, in any of the subdirectories specified with the
- DOS PATH function.
-
- Required files:
-
- profile file
- The file that contains the parameters and
- options that control both Indexer and
- Finder. Detailed information on the
- profile can be found in the appropriate
- sections of this manual and also in the
- sample files, PROFILE.xxx.
-
- F-FILES.ext
- The list of text files that have been
- indexed. You may edit this file to change
- the drive and directory designation of a
- file, or to add additional non-indexed
- files at the end. DO NOT remove file names
- or change the order of the files, or Finder
- will get confused.
-
- F-IX.ext
- A file containing the locations in the
- text of the keywords.
-
- F-IK.ext
- A file containing the keywords, the number
- of occurrences in the "database", and where
- in F-IX.ext those occurrences are.
-
- F-TAB.ext
- A control file with information about the
- other files.
-
- exclude file
- A list of words that are not searchable.
- This list should be built prior to using
- Indexer and should not be changed for
- Finder. It is an ordinary text file, with
- each word to be excluded placed on a line
- by itself.
-
- Optional files: (These are not necessary to run
- Finder.)
-
- F-DLMLOC
- If the Record option is being used, this
- file contains the location in the
- "database" of each record.
-
- F-FENCE1
- If the Record option is being used, this
- file contains the record offsets
- of each keyword.
-
- F-HELP
- The Help file. This is an ordinary text
- (ASCII) file that you can modify to meet
- your needs. See the Help section for more
- information.
-
- You do not need to have all of your text files
- online at all times. If you have only a
- floppy-based system, for instance, you can load your
- text files only as they're called for. If you do
- swap diskettes, however, you should write-protect
- the diskettes by covering the write-notch. This is
- to ensure that if Finder accidentally terminates and
- DOS tries to update the directory on your text
- diskettes, that it will not be able to do so.
- License
-
- Carefully read the following terms and
- conditions. If you do not agree with them, you
- should not use the package. Use of this software
- package indicates your acceptance of the terms and
- conditions.
-
- Eric Balkan, dba "Packet Press", hereinafter
- "Publisher", provides this software and licenses
- its use. Your use of the software acknowledges
- that you have read this agreement, understand it
- and agree to be bound by its terms and conditions.
-
- You may use this software for any purpose,
- subject to the following restrictions:
- 1 - You may copy the contents of this diskette for
- yourself or for others, but the entire diskette
- must be copied without change;
- 2 - You may charge for copies of this diskette, but
- not more than $8;
- 3 - You may not modify any of the files on this
- diskette without the permission of the Publisher;
- 4 - You may not incorporate this package, or any part
- of this package, in a commercial product, without
- prior arrangement with Packet Press. (Typically,
- we will ask for a nominal royalty fee.)
-
- This license is effective until terminated. You
- may terminate it at any time by destroying the
- software together with all copies. The license also
- terminates if you fail to comply with the terms and
- conditions of this agreement.
-
- This software and accompanying documentation are
- protected by United States copyright law and also
- by international treaty provisions.
-
- The publisher makes no warranties in regard to
- this software. No claim is made that functions
- contained in this software will meet your
- requirements or that the software operation will be
- uninterrupted or error free.
-
- In no event will the publisher be liable to you
- for any damages, including any lost profits, lost
- savings or other incidental or consequential
- damages arising out of the use or inability to use
- this software, or for any claim by any other party.
-
- This agreement shall be construed, interpreted
- and governed by the laws of the state of Maryland.
- you agree that this is the complete and exclusive
- statement of this agreement which supersedes any
- prior agreement or other communication between us on
- this subject.
- Registration
-
- The software on this diskette has been provided
- on an honor system "try-before-you-buy" basis.
- (If you paid less than $9 for this diskette,
- your payment was simply for a duplication/handling
- charge and for the diskette itself, not for its
- contents.)
-
- As much as we'd like to provide this software
- for free, its development took approximately 800
- hours. (That's 800 hours I could have spent making
- some money to meet my mortgage payments.) If you
- like AnyWord and would like to continue using it,
- then please pay for it. Of course, this is all on
- the honor system and we won't be checking up on
- anyone. However, if you do send us a check for
- $35, you will be supporting our past and future
- efforts in developing affordable information
- retrieval software. We'll also make it worth
- your while.
-
- By sending in your $35, you will automatically
- become a member of our AnyWord users group (AUG).
- Members of AUG get:
- - a $15 credit towards other products
- - a free DOSCARD
- - technical support for problems
- - assistance and advice with particular applications
- - programming support at a discount
- - free bug fixes
- - inexpensive update disks with new features
- - other benefits from time to time.
-
- We also give bonuses to members when
- they bring in other members.
-
- To join, just send us a check or money
- order, with your name and address, and the AnyWord
- serial number (which you can get by doing
- SHOWID at the DOS prompt -- this is how we credit
- whomever you got your disk from). If you think you
- will be distributing copies to others, give us your
- social security number (or other identifier) as well,
- so we can give you credit. (And run MAKEID to record
- your number on the disk.)
-
-
- Eric Balkan
- Developer/Publisher